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1 

^mmu. »i^itti2i (WT, rsc A2 j t i^o zttffo&) (om 
sii^coc dn AHffr. %*l#=i— K-r Ki^-rs-. 

sca2ii ^m&v*uni&&mmmmzBwz^z-z>* i%$-&m±m 

Sjfi. • $2^U<f*£l&fc (dentator ubra I -pa I I i do I uys i an atr 

ophy (DRPLA) ) ^ 6 fl^KD^^tt^El^fe^r . JSEHlfc^ttC A Gfrblt 

SCA 2 IZO^T^ JISil^TrliC A G (DlS^^ti^ Ltt^^t 
0££*lTl^& (Trottier, Y. et al. Nature. 378, 403-406 (1995)) „ 

**W0>B ftfi it»E5iJ^^E^tLf-> S C A 2 (DJlSfifcT tf> c DN AUt 
C <t T'fc€>o ^fc. #$680) § Ml** SCA2 <7)>S^fct<*M(D 

tc&b<D'&&mtLx&mu, sca 20iisiifi^i^so^-c^$^^ / 5' 

SCA 2 <7);&^l3*f S C A 2 <Dig@iIfo¥-<J>7 >^-t2 >X&JH&t-£C £ 

"C (fc •S o 

y h<7)#, UiiL^^ii^LTL>S. 2. 5 k b (7) 7sp El U ^<7)S|1 
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2 

tlsU?W'<Xt&* l )3X / 7^*^ V&ZTu-yt Ltffl^tthcDNA 
4 -j ,j _ £ x . j _ - y L T 'Z *i h 2 MH(D y P - 3 ' <75M^ ty \ A U V 

Ltl^tthcDNA7^7 l J-$7{; l J--^U C(D^-^'i:/W 
^'J^f X«c DN Aif^r$^^P-->^Lf-o :ilb<Dc DN A»#0> 

^iif i]<DMiDc(DiI»iiB9J ^J^ST & tz&blZ, £ £|C R A CE (rapid amp I if icat 
ion of cDNA ends) £frfcl\ £bf~5' §lj<D3ii$PCD&Sie?'l &tzMZ 
R T-P CR £fir£l\ S C A 2 (DlSHilfe^f tf> c DN A(D±$1TO^SIS?'J ^ 

■ % 1 6 6 # Bfcl* L m i 8 8 #@ <D G.I n 5 ~ 1 0 0 <D|H| T; E 

m^mmcDmrnmn -iz i y a — k £ *i«5 t 5 / #ib^j » 

XLT^^gllR^PlS^; 15 b p lil±CD+t-<X$^1-^T>^-iz>X«^ 

fcSgBJIClJ; y , ^SiE?iJ^^ $ tLfc, S C A 2 <D&m%Lfc=¥-to c D N ABUT* 
^mtt^titco ##§B;3<D*2i£@ifTtt lc J; y p- K£ti&* >/<?RteS C A 2 CD;& 
JSlZffll^ d L, SC A2(DMiRU'illi-t^Sft^5S W: 

■t&tz&bCDftL&mt LXm^Z>Z\ttft!$Z> a ££1^ fcfBllHf::* 1 ^ SCA 2 
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i i it. *§£W(Dmmm\zte^T&j££titz. *mwc dn akt^^sis 

A 2|t©IIltfc^ 

i6lt W CO H »j IC t# htltzfS ~V 0 DN ASfT^Tspl RXS Ts 

p2 IfeDClC S C A 2 c D NAW X\ ^S&^IPSM^^^'-^ 
c D N AgftJtCO+K'f XSra^tlffe^o 

El'7li, (cag) 55 ^p-^^mt^-Citl^H/c. sE^B=fMscA2i&B 

f (D C A G SI*I40)IS!I(D»^ i ^ fe^o 

±iecocb^i^. *§£W<mmmK\z. mmommm i^wts^i 

IE*ij£=i- K^^MMM^^to fcf£ U 11 6 6f@^^Ll 1 8 8#@<7) 
Gin CD^JSLMl 5-1 O OCOW-e^lb-r-So fcfc* Z CDf§ L J ii L&l*> 11 

SAtitr 5 -25 flu-efc y . -75. s c A2if-eii3 5 ~ 1 o oii-eMo 

lcfc^T3ll^lc^$tircit*iBMI«5iJ#-^l £.1/0 1 ~4I-;f£*i6o 
JS£E5UA<if<B J: 9 I- LT;^^ tlfc^. &tz. C(D^Sie5^^i)C DN A 
^SCA 2(DJl@iteT(Dc D N AUfc^CtliTtS^^JI^^^^"^ 1 ^" 
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(D-d. thcDNA7-f^7U-^iLtfflt^PCRiaW^ 
l:fO)PCRli^^P-^i L^W^'J Sf-f-tf-va <7P--> 

> ^teicfl^ t# £>*i x. ^ -7 3 -T??I ±« ^ fl^S & - £ & 

-SCI £A<t?£ 5. Z(D.J;5^Jl*P<D^I*^iX.I^ D- M. Glover. DN 
A Cloning Volume I I I a practical approach, 1987. IRL Press KfEi££*lT 

15 Lfc O PCR (LA PCR) ([S]#tf ^ Ifil^Sii Vol41 . No. 5. 585-594, 1 

996) £ffl^rii>!ii£-ti\ mus^^ummmmmL, z^-mm^y^^z k 

K <7^--efc-g, pGEX (77iRv7ttg) (7)-7;Ut c ^P-->^uI^AL 
T ^>r-v3>U #btLf^fi^^^^^-t:^fli@D H 5 (GIBC0 BRL 

20 > tf i> 1 J >»1$) <7)^K$5}3M*£31fR . l£ff2M$i&*fcfr ^ SC A 2 <7)lS@ite 

iSWSfiKtts GST (Glutathion S-Transf erase) t <7)i4n" SSH <t UT^IH 
-f &<7)T\ rti^^GST ^ (77JRv7ttI) £mi^#^K&fcii^£ W 

25 M^'AOite^ ( G A G'J t°- h# 15-2 50) |C»0'^rM^* tlfc ^ > 

s c a 2 j^zs^-rs - t icdt v s c a 2^mLJL^0mx^mm^ith 
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^t- gag y t°- hizj:or : =i— K£*i<£>*"y :s i^m^m^x, ^<d&£ 

*^BJI~^ y. SCA2 it^it^-C, 

C A 2 S^CDMsl^ >/^KI*MSA£>tC7)J:y C A G V tf- htf&^fttz 

'^t-afe^JSflRi: Lttt, Utt SCA2 »<^£^±<t IE 1 *? SCA2 jt^<D2»A<D 

25 <7>?P «y 5 v tf^i Ltlj:, mJi*<DJ; ? fc, , V >e h PT:i#btltc S C A 
2jIfcTj£^<W^^«^ tf- htf> 

s c a 2m%\z&^T\z. gigtisswnmmi-ftm? * 
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Ay T? « F>3 ^ # A f & ^ t i> "51 V & & o 

^^'-j^v x L-c*o)ffliR^ias-r £ ta> -e& L J < < £ * 1 s t> P wax 

T>^-b>X(D+^' XI* 1 5 b p U±. tmmXCD^- m®G>±& 

yT (D^-f x - <t l<> & y l<i£ 5 o b P ui±* =j- km 

R N A^^aPXIi^nP^^^f-^l^iW^i^SiE^J L "C l N/ a> ^ <t L l x 7b\ 
£fcl*T'/W ^U^>fX^#^ff(tc7)^^t4^^LTLNS^^t^^^$Bffl^ 
lz£3:;fc&o. $il©7>f -b >x§ sc A 2 ltl:S4t ^ - 1 1- W > ^ 

<ds c a 2 Bisae** ? □ wtz> z a^i;«^**^ 

iiSi*fi I k g , 1 *§£W<DT>^lz >X&B*£0. 001 mmo! -1000 mmol 

H^^-iLtm^ mwamu. m*ES*v<>i.i2 no. is caron 199 

4, pp. 114-121 ; ffl^atB*^ Ultg^ Vo l. 12 No. 15 (iiflj) 1994, pp. 34-40) . 
"Cft*. a^iciil^^TTV^I-^XSSDNA^EIaCD-^^ E3£fX« 
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(1) (CAG) 5S z?n — -J<DfR® 

C A GSam<i$ 5 O ■ (dentatorubra 

l-pal I idotuysian atrophy (DRPLA) ) Sfc^ (Koide, R. et al. . Nature Genet. 
, 6, 9-13 (1994)) jD R P L Altro^V = 7^ DN A^biiS L> 
10 5 K^<7 £ -pT7Blue T (p-2093) \ZVJ ' V n--^<?Ltz,> p-2093 I** (GAG) 55 
SU:f(DSS(D77 >*>?iE$»J£^L\. "tfc:b*>* 5" -CAC CAC CAG CAA CAG C 
AA (GAG) 55 CAT CAC GGA AAC TCT GGG CC-3' VB. £*L&ffi*IJ U 
13 51 0 U^T K^-f^ — > -fUt>*>> 5' -CAC CAC CAG CAA CAG CAA CA-3' 
5'- tf^^-GGC CCA GAG TTT CCG TGA TG-3 JSL^ P C R^fitfof:, P C 
15 Rli. 10 mM Tris-HCI, pH 8.3. 50 mM KCI . 1.5 mM MgC 1 2 , 2M N, N, N- h *) > * 
>lU-y 1 J V>> 0. 1 rtiM TTP, 0. 1 mM dCTP, 0. 1 mM dGTP. 9. 25 MBq (D [ a - 32 P]dAT 
' . P (222 TBq/mmol), 0.5 ^ M fOfl)#^7^7- 0.3 ngCD^XS KD N A 
(p-2093) 7£tf2 .0 U<7)Taq D N Atf 'J * 7— tf ( 0 W^SS^^ 

) £^fc£fi1 6/i \ (D®i&zm^x'nitz„ ^#)i::9 4°ct-2#P«11£<to 

20 fcf, 9 4°C, 1 ■ftCDgte, 5 4°a 1 »<D7--U>>?> 7 2°C. 3#<B#ft 

£3 o+r-f ^juffi^ ««l=: 7 2tti o^M^frofco 

2 0/. I (DXhL/^hTfv^^ilLfcHblf-X' (Dynabeads M-280. Str 
eptavidin;/;i^x-, Dynal AS *±§D £ffl^T-*i*(CAG) 5!) 

25 l ; 5mM Tris-HCI (pH7. 5) v O. 5mM EDTAW'IM N a C 1 

4 0/i I (7)jg^-ee-X^;5fe;tLfco ^t5 0^ \<DO. 1M NaOHill 
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mmti.m*9\ 22, 43. 5 1 x-^^tis Fn^y^?*-^*^ 1 ^ 

^y^^-otztZh. (CAG) 55 ^P-^ltCAGfitW4 3ao;5 1 

'J ?^Xl±lt\ 9HCDiie j: 5 1 i:li±< ^-f^'J ^^X'L^of;(K. Sanpei et 
al. .Biochemical and Biophysical Research Communications. Vol.212, No. 2. 
1995, pp. 341-346). tUt>^. Z(DZf d-^^m^hlt. /W?'U $V^-V 

(2) s c a 2 mi^oy^&Mim^ 
15 1-1^ s c a 2 &mom*M%7Kt 0 z.<Dmzk®\zz>^x. mmmn. 

DNA (#15/i g) £1 0 0 UCDTspEI ( S ^S^IRJt^>^^^ttM) 

4 -tf— V 3 2.75 x SSPE (1 x SSPE =150 mM NaCI, 10 mM NaH 2 P0 4 , 1 mM 

EDTA) , 50% 7\s)\sJ+T £ Kv. 5 x t-*>/ \;U^ ;gu 100 ng/ml m^^r'^ DN 
AS.tK(CAG) 5b ^P-? (6 x 10 6 cpm/ml) ^^^ 6 2°C-C 1 8 ^fir 
otco /W^'/^-f-ti— V3 0. 5% SDS I x SSC (150 mM NaCI, 15 

mM^X>it^hU^A) "CSI^e 5°C-C0. 5B#|«#Lfco MSif 
^ x /7 ij — Kodak Bio Max MS ? -< )l J*\Z 1 6 B^sl - V 0 > 

2.5 kbp (DTspEI ir^^ttS^tltc, 

ZZX\ -AO) SCA2|f (El 5 *<D7 ) ^ ^ D.N A &1$&lz£ 



15 



20 
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yjfttbu *<B2 7 o n g^TspEi vmitL, TJSn-x?)ism%b%wuzfrii 

tz 0 ±3ELfc2.5 kb (7) TspE I ffirtt ££#«/f / S 'V * D N A »ftf£ , EcoRI 
■ ."??flfl2!iL7VA ZAPII. (Stratagene — |C? P — Lfco 

>$f ^ > 3 >l* 5 x SSC, 1 x =ry/\)\,V : M. 1 O o/oMt^* h 5 2 
10 OmM'jylt 4 0 0^g/m l(Dt hlfe$DNAMTsp1^D- 

i^tfjIZO. 1 x SSC-0. 1% SDS'Ha-CS 2°C"eO. 5 B#^;5t)# Lt-„ MSigl&T^U 
— HI £ Kodak Bio Max MS T7 WJU-MZL- 7 0°Ct'2 4 B#|HI*— V 

^-^7^ iz^it^o -to)^; .^mm*LmB&&*to , f; 5^^-^ 

15 T SP 2 ^M$tl7=o 

%tz. Tsp2 (7) Smal-Apal ir>t (630 bp) <7>i&*ia*'J£ U C A Glf i 
^^tt&eitTl- 1 ;^^^ btf KF-1 (5--CCC TCA CCA TGT CGC TGA AGC-3 
' ) ®. R — 1 (5 -CGA CGC TAG AAG GCC GCT G-3' ) ZW& ttz (i1 o fc 

h^Bia^Hc D N A^ ^^ 'J - (Stratagene *±SH) JtU Utf KF 

20 -UtlR-1 ^^P-^'i: LtfflL^tT.^ ■j-^^Lfco /\-T ^' 'J ^W-tf — 
va >(^ 6 x SSC, 10 x =t^i\)\,VWl. 0.5% SDS, 0.05% t'P'J >iW- HJ ^ 
As 100 ng/ml |5Wf^ Vffi^ D N A&t;^««^ ^ ^ n ~^* 

#<tr;§;£<£Tr 5 5 °C-C 1 8 Mffofc c /n^ ^ U -iz- v a !!£lt^ltt 
IC 0. 5% SDS 0. 05% t°P'J >ithU^Am6 x SSC ^55^0. 

25 5 B#ffl£$ L fc„ [5575 <£> T^P-^^W 4.0 kb (D c D N A >7 

D-> Fcl A<&t>*lfco Fc1 , Tspl £. IS Tsp2 (Dt*»I5$iJ $^ t> tt$ ^ L tc ^ 

fcfc, Tspl SlU Tsp2 (7)*]IS»iifeBS tftf K Fc1 SU'm^^lfffr^-^^ 
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WiKZ^a-zft LXm^xm^Ot hcDNA7f^7'J- (t h«£M, 
t UteiaBiv t 'MS&tft MBift) $X^'j-->^-r«>-i:l=«fcy» c D N A 
?P->Fc2 s Fbl4, B4. C6 &tfd..9 £M£Lfc (IH6#BB) 0 Fd 0 5 ' 3fc 

5 iffi£[^£^&7i#)[^ 5' -RACE-Ready cDNA (Clonetech, Palo Alto, CA, USA) £ 
ffilN-C 5" -RACE ^'Ulfz (Frohman, M. A. et al, Proa Natl. Acad. Sci. USA 
85. 8998-9002 (1988)) „ -J=> 4 7- R - 1 £1f? 1<7> P C R iCffllV ^7-f7 
-R-2 (5 -,CTT GCG GAC ATT GGC AGC C-3' , m 1 #88) £B 2 <B P C R (Cffl 
lV/c 0 ftfc. forward m<DZ??4^-t LTI^lTtl^l^ F - 1 (Ml #P V 

10 ) ^ffl^fro 350 bp <D 5- -RACE j&ifo (5R1) £ pT7Blue T ^ Q- 
■ ( P T78iue T-vector(5R1) 0 5R1 (D® -ffii, Fcl, Tspl &lPTsp2 

ot^mmmtoTi—^-^vy^^^mmLtzo c dna<d3' mmzmfc-tz 

fzthK. t hBia^btttii^tlfr poly(A)'mRNA(7)1 n r %$%%lt LXMIK ^7 
_ i 3 (5' -TTC TCT CAG CCA AAG GCT TCT ACT ACC-3' 0 3#1) £ 
15 ^<^-<t LXffl^T3' -RACE Zfttiotio mhtltz 1300 bp <D 3' -RACE it 
!|$D (3RD £ pT7Blue T $ ' -[ZV L tz (pT7Blue T-vector (3R 

: d)o '.■ ■ ■■ . 

c DNA(7)5* M^^^^lCfllK^tc^lZ. j£|E?PCR (RT-PCR) £<T&o 
fco -?Ut>1i. t MlfrbCD*- K^v-^^^m^tlfi^RN A^^ RNas 
20 e (Promega *±M) XMitLtc (Onodera. 0. et al. , Am. J. 

Hum. Geent. 57, 1050-1060(1995)) 0 l$E?5f5li, 2 
O pmol (7D^>^A^^r-9-7-^ffil\ 4 2°CXftU-?tz (Onodera, 0. et al. , 
fllfg) o PCRCDy'y^'^-t LTli. F1006(5'-TAT CCG CAG CTC CGC TCC C- 
3 N mi #P§) S.l>*R1002(5 , -AGC CGG GCC GAA ACG CGC CG-3' , HI ^ 
25 ffl^t.o 5 pmol fO^^-fT- 10 mMTrisHCI (pH8. 3) , 50 mM KCI , 1. 
5 mM MgCI 2 . 1.7M N,N,N- h U /fJ^'J vX 200 » M i*O0> dATP, dCTP & 
Dt TTP \ 100 a< M (7) dGTP, 100 M (7) 7 — T : T"tf dGTP &Xt 2. 5 U (7) Taq tKU 
^ 7— tf (tSiittM) £#^1: 2 O V I (70;g;&4^ P C R £fr.£ o ^ 9 6 
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7 2 p ci ^fffl<D#fiifiAvt)fiR*-9-'r^^*3 oiaiiyjSL, 

■2^-e5^ra#«-r'<5^<!:lcd:y PC.R$trftofc. *-<7)*Sg:, 5R1 CD 
±;jfj~ 246 bp MlK-S. ^7P->5R1^lfc (!H6#I) „ 

fcfc, E6tK T SP 1 Slj: Tsp2 »f)t^atttfl)^l*S O A 2 c D N A.*l: 
#£-f &®i*£*-f:. S C A 2 c DNA*<ft£&#<D^li=I- K1HDc£ 

7ikt 0 c AG&mmo)imizmmv<D*;vo xv^ztir^Zo t sp ei cd , 

Not | (Nk Sac II (Sk Sau3AI (Sa) % Sma I (Sm) . Eco52l (E52h Apa I (Apk 
Accl (Ac) v BaraHl (B) , Xho I (X) . EcoR I (E) Rlf Pst I . (P) (DUMmmW 
itLtffikZtlX^&o #c D N A^P->W XS(/W3 >t >tXS C A 
2 c DNAC0T!C*cf^tL\|)„ 

* 0 Is*? K^x-V* — 5 * — V 3 >;£ (Sanger. F. et al . Proc. Nat I. Ac 
ad. Sci. USA 74. 5463-5467 (1977) ; Chen E. Y. et al. DNA 4. 165-170 (1985) 

) i- * y d n A(D-*m<Di&&mm z&m Ltz, cag sw&t/f^m 

M(D&mW\Z&3£i-Z>t~#>^ tt*>teCtz F-1 St/RS-l (5'-CCT CGG TGT 
CGC GGC GAC TTC C-3' ) fcfflt^r P C R U , CAGS«ti^V = 

^Mtt£i^SLf::o 0. 25 /i M-fO^^^-r^-V 1° mM Tr is HGI ( P H8. 3) . 
50 mM KCI. 2.0 mM MgCI 2 . 1 . 7M N, N.N- h U^fJ^'J v 200 ^ M iTO 

<7> dNTP. 200 ng (D^V £ «y 2 DNA. #.l>* 1 . 25 U (7) Taq > =7—M (^>I 

itttM) ^#fc^«2 5 „ I (7) PC R^T^o/-o 9 5°Cf2»W 

^i4^^T^o7t^ 9 5°ci ^m<Dm^m. e 2°ci #fm<7>T--u 

7 2°C1 ^(7)#^X|i7bN^fife^-9--<^^^ 3 2 : lSl«fty 5B:U. *&U:7 2°C-e5 

±lE&c DN A? P-XDit»lB5Ul^a^^ ^'JAfH^t4 3 5 1 
b pC03>t>^SCA2cDNAffi5U^Ufc (|E5lJ#^ 1 0 1 ~4, 
IU6 #BB) 0 !B5'l#-§- 1 5)4 3 5 2 n t ~ 4 3 6 7 n t U A^f-;b£ 
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JiiLfccDNA^n-Vefc^CI 9. B 4&l>'3 R 1 |-£^"C. IH~tf>tefi 
|- U A t— ;uj!)<#^-9 u 6 t A<5tS £ *lfc 0 
Sgjfeffij 2 ^#tj3(DCAGSIiiiiS(I3iS 

^ ^ -7_ F -i &XSR - 1 zmi^xmhtitzP cRii^t; u v s 

10 mM Tris-HCI, pH 8. 3, 50 mM KCI, 2. 0 mM MgClj. 1 . 7 M N, N, N- h 'J > 
<7"'Jv>s 111KBq0>[ a- 32 P]dCTP (111 Tbq/mmol) ... 30 n M dCTP. 200 m 
M(D dATP. dGTP TTP, & 0, 25 /* M.flDJtlE 2 200 ng (75 

= 9 D N Aifet>*!C 1 . 25 U (7) Taq DNA ^ 'J / 1 O n I 0) 

;§;^-e^otc 0 9 5°C V 2^«13l£4to7^ 9 5°C V 1 #ra<D£i«L 

6 0°C, 1 ^(DT-- 'J >9'RlS7 2°C. 1 flfaKOfafi^ *> £ ? ^£ 3 

2 QMS UIl , »IC7 2 °C. 5 M C7)frS £ fro fco S C A 2 ' fifc* □ ~ 
>-(t$tif=^V = ^M^J^T^btif^ ffl* <7)+M X(DC A GSPli^ 

^frlB^^*— (sequence I adder) X^ — 1) — t L"Cfflt N fc 0 C ..;AGS«._ 

Mm^lCC A A ^#t;iE^4^lLiSfeT-C0li^l^!i. C A A J|M£l*C A 

GM-tM X|-#^7to C A Gfim^ii^: LTL^ S C A 2 *t:£iHE?-0>*£» 
CAGM^S^^±IS^AlB5t]liCAGMm(D^^Xllli^^^^o7t 0 

±IH0*5£J::i y . H^A ( 2 8 6^) TSiXS 1 0»SC A 2 lf ( 3 
4 fi(D S C A2ffe^) iCifcl* & C A G J£1gmte& £ ;1'I5£ L tc<> H 7 IC tr 

^^^^SCAai^l^tNt^^jF^ 

T-efe^CDlZ*fL. S C A 2115^1^1 3 5l@m±Trfco fc Q * o T\ ±8E 
ICJ; tj £ tLfc c D N A lis SCA2 <fl 0 [*Hl<5?-<D c D N A ~tr tfc -§> - ' £ fi£ 

Qrt> 
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mm 

m&m^: 1 

mmo&z : 4 3 6 7 

5 mo^ •. -#« 

BE5U 

TATCCGCACC TCCGCTCCCA CCCGGCGCCT CGGCGCGCCC GCCCTCCG ATG CGC TCA 57 

Met Arg Ser 

10 ' 1 

GCG GCC GCA GCT CGT CGG AGT CCC GCG GTG GCC AGO GAG TCT CGC CGC 105 
Ala Ala Ala Ala Pro Arg Ser Pro Ala Val Ala Thr Glu Ser Arg Arg 

"■ 5 : " ; 10 . '. 15 ;'• 

TTC GCC GCA GCC AGG TGG CCC GGG TGG CGC TCG CTC CAG CGG CCG GCG 153 
15 Phe Ala Ala Ala Arg Trp Pro Gly Trp Arg Ser Leu Gin Arg Pro Ala 
20 25 30 35 

CGG CGG AGC GGG CGG GGC GGC GGT GGC GCG GCC CCG GGA CCG TAT CCC 201 
Arg Arg Ser Gly Arg Gly Gly Gly Gly Ala Ala Pro Gly Pro Tyr Pro 
40 45 50 

20 TCC GCC GCC CCT CCC CCG CCC GGC CCC GGC CCC CCT CCC TCC CGG CAG 249 
Ser Ala Ala Pro Pro Pro Pro Gly Pro Gly Pro Pro Pro Ser Arg Gin 

55 60 65 

AGC TCG CCT CCC TCC GCC TCA GAC TGT TTT GGT AGC AAC GGC AAC GGC 297 
Ser Ser Pro Pro Ser Ala Ser Asp Cys Phe Gly Ser Asn Gly Asn Gly 
25 70 75 80 

GGC GGC GCG TTT CGG CCC GGC TCC CGG CGG CTC CTT GGT CTC GGC GGG 345 
Gly Gly Ala Phe Arg Pro Gly Ser Arg Arg Leu Leu Gly Leu Gly Gly 
85 90 95 
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CCT GCC CGC CCC TTC GTC GTC GTC CTT CTC CCC CTC GCC AGC CCG GGC 393 

Pro Pro Arg Pro Phe Val Val Val Leu Leu Pro Leu Ala Ser Pro G I y 

100 105 110 115 

GGC CCT CCG GCC GCG CCA ACC CGC GCC TCC CCG CTC GGC GCC CGT GCG 441 

5 Ala Pro Pro Ala Ala Pro Thr Arg Ala Ser Pro Leu Gly Ala Arg Ala 

120 125 130 

TCC CCG CCG CGT TCC GGC GTC TCC TTG GCG CGC CCG GCT CCC GGC TGT 489 

Ser Pro Pro Arg Ser Gly Val Ser Leu Ala Arg Pro Ala Pro Gly Cys 
135 140 145 

10 CCC CGC CCG GCG TGC GAG CCG GTG TAT GGG CCC CTC ACC ATG TCG CTG 537 

Pro Arg Pro Ala Cys Glu Pro Val Tyr Gly Pro Leu Thr Met Ser Leu 

150 155 160 

AAG CCC CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAA 585 

Lys Pro Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
15 165 170 175 

CAG CAG CAG CAG CAG CAG CAG CAG CAG CCG CCG CCC GCG GCT GCC AAT 633 

Gin Gin Gin Gin Gin Gin Gin Gin Gin Pro Pro Pro Ala Ala Ala Asn 

180 185 190 195 

GTC CGC AAG CCC GGC GGC AGC GGC CTT CTA GCG TCG CCC GCC GCC GCG 681 

20 Val Arg Lys Pro Gly Gly Ser Gly Leu Leu Ala Ser Pro Ala Ala Ala 

200 205 210 

CCT TCG CCG TCC TCG TCC TCG GTC TCC TCG TCC TCG GCC ACQ GCT CCC 729 

Pro Ser Pro Ser Ser Ser Ser Val Ser Ser Ser Ser Ala Thr Ala Pro 
215 220 225 

25 TCC TCG GTG GTC GCG GCG ACC TCC GGC GGC GGG AGG CCC GGC CTG GGC 777 

Ser Ser Val Val Ala Ala Thr Ser Gly Gly Gly Arg Pro Gly Leu Gly 

230 235 240 

AGA GGT CGA AAC AGT AAC AAA GGA CTG CCT CAG TCT ACG ATT TCT TTT 825 
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Arg Gly Arg Asn Ser Asn Lys Gly Leu Pro Gin Ser Thr lie Ser Phe 

245 250 255 

GAT GGA ATC TAT GCA AAT ATG AGG ATG GTT CAT ATA CTT ACA TCA GTT 873 
Asp Gly lie Tyr Ala Asn Met Arg Met Val His lie Leu Thr Ser Val 
5 260 265 270 275 

GTT GGC TCC AAA TGT GAA GTA CAA GTG AAA AAT GGA GGT ATA TAT GAA 921 
Val Gly Ser Lys Cys Glu Val Gin Val Lys Asn Gly Gly lie Tyr Glu 

280 285 290 

GGA GTT TTT AAA ACT TAC AGT CCG AAG TGT GAT TTG GTA CTT GAT GCC 969 
10 Gly Val Phe Lys Thr Tyr Ser Pro Lys Cys Asp Leu Val Leu Asp Ala 

295 300 305 

GCA CAT GAG AAA AGT ACA GAA TCC AGT TCG GGG CCG AAA CGT GAA GAA 1017 
Ala His Glu Lys Ser Thr Glu Ser Ser Ser Gly Pro Lys Arg Glu Glu 
310 315 320 

15 ATA ATG GAG AGT ATT TTG TTC AAA TGT TCA GAC TTT GTT GTG GTA CAG 1065 
I le Met Glu Ser I le Leu Phe Lys Cys Ser Asp Phe Val Val Val Gin 

325 330 335 J 

TTT AAA GAT ATG GAC TCC AGT TAT GCA AAA AGA GAT GCT TTT ACT GAC 1113 
Phe Lys Asp Met Asp Ser Ser Tyr Ala Lys Arg Asp Ala Phe Thr Asp 
20 340 345 350 355 

TCT GCT ATC AGT GCT AAA GTG AAT GGC GAA CAC AAA GAG AAG GAC CTG 1161 
Ser Ala I le Ser Ala Lys Val Asn Gly Glu His Lys Glu Lys Asp Leu 

360 365 370 

GAG CCC TGG GAT GCA GGT GAA CTC ACA GCC AAT GAG GAA CTT GAG GCT 1209 
25 Glu Pro Trp Asp Ala Gly Glu Leu Thr Ala Asn Glu Glu Leu Glu Ala 

375 380 385 

TTG GAA AAT GAC GTA TCT AAT GGA TGG GAT CCC AAT GAT ATG TTT CGA 1257 
Leu Glu Asn Asp Val Ser Asn Gly Trp Asp Pro Asn Asp Met Phe Arg 
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390 395 400 

TAT AAT GAA GAA AAT TAT GGT GTA GTG TCT ACG TAT GAT AGO AGT TTA 1305 

Tyr Asn Glu Glu Asn Tyr G I y Val Val Ser Thr Tyr Asp Ser Ser Leu 

405 410 415 

5 TCT TCG TAT ACA GTG CCC TTA GAA AGA GAT AAC TCA GAA GAA TTT TTA 1353 

Ser Ser Tyr Thr Val Pro Leu Glu Arg Asp Asn Ser Glu Glu Phe Leu 

420 425 430 435 

AAA CGG GAA GCA AGG GCA AAC CAG TTA GCA GAA GAA ATT GAG TCA AGT 1401 

Lys Arg Glu Ala Arg Ala Asn Gin Leu Ala Glu Glu I le Glu Ser Ser 
10 440 445 450 

GCC CAG TAG AAA GCT CGA GTG GCC CTG GAA AAC GAT GAT AGG AGT GAG 1449 

Ala Gin Tyr Lys Ala Arg Val Ala Leu Glu Asn Asp Asp Arg Ser Glu 

455 460 465 

GAA GAA AAA TAC ACA GCA GTT CAG AGA AAT TCC AGT GAA CGT GAG GGG 1497 

15 Glu Glu Lys Tyr Thr Ala Val Gin Arg Asn Ser Ser Glu Arg Glu Gly 
470 475 480 

1 " 

CAC AGC ATA AAC ACT AGG GAA AAT AAA TAT ATT CCT CCT GGA CAA AGA 1 545 

His Ser lie Asn Thr Arg Glu Asn Lys Tyr lie Pro Pro Gly Gin Arg 

485 490 495 

20 AAT AGA GAA GTC ATA TCC TGG GGA AGT GGG AGA CAG AAT TCA CCG CGT 1593 

Asn Arg Glu Val I le Ser Trp Gly Ser Gly Arg Gin Asn Ser Pro Arg 

500 505 510 515 

ATG GGC CAG CCT GGA TCG GGC TCC ATG CCA TCA AGA TCC ACT TCT CAC 1 641 

Met Gly Gin Pro Gly Ser Gly Ser Met Pro Ser Arg Ser Thr Ser His 

25 520 525 530 

ACT TCA GAT TTC AAC CCG AAT TCT GGT TCA GAC CAA AGA GTA GTT AAT 1689 

Thr Ser Asp Phe Asn Pro Asn Ser Gly Ser Asp Gin Arg Val Val Asn 
535 540 545 
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GGA GGT GTT CCC TGG CCA TCG CCT TGC CCA TCT CCT TCC TCT CGC CCA 1737 
Gly Gly Val Pro Trp Pro Ser Pro Cys Pro Ser Pro Ser Ser Arg Pro 

550 555 560 

CCT TCT CGC T AC CAG TCA GGT CCC AAC TCT CTT CCA CCT CGG GCA GCC 1785 
5 Pro Ser Arg Tyr Gin Ser Gly Pro Asn Ser Leu Pro Pro Arg Ala Ala 
565 570 575 

ACC CCT ACA CGG CCG CCC TCC AGG CCC CCC TCG CGG CCA TCC AGA CCC 1833 
Thr Pro Thr Arg Pro Pro Ser Arg Pro Pro Ser Arg Pro Ser Arg Pro 
580 585 590 595 

|() CCG TCT CAC CCC TCT GCT CAT GGT TCT CCA GCT CCT GTC TCT ACT ATG 1881 
Pro Ser His Pro Ser Ala His Gly Ser Pro Ala Pro Val Ser Thr Met 

600 605 610 

CCT AAA CGC ATG TCT TCA GAA GGG CCT CCA AGG ATG TCC CCA AAG GCC 1929 
Pro Lys Arg Met Ser Ser Glu Gly Pro Pro Arg Met Ser Pro Lys Ala 
15 615 620 625 

CAG CGA CAT CCT CGA AAT CAC AGA GTT TCT GCT GGG AGG GGT TCC ATA 1977 
* ■ Gin Arg His Pro Arg Asn His Arg Val Ser Ala Gly Arg Gly Ser lie 
630 635 640 

TCC AGT GGC CTA GAA TTT GTA TCC CAC AAC CCA CCC AGT GAA GCA GCT 2025 
20 Ser Ser Gly Leu Glu Phe Val Ser His Asn Pro Pro Ser Glu Ala Ala 
645 650 655 

ACT CCT CCA GTA GCA AGG ACC AGT CCC TCG GGG GGA ACG TGG TCA TCA 2073 
Thr Pro Pro Val Ala Arg Thr Ser Pro Ser Gly Gly Thr Trp Ser Ser 
660 665 670 " 675 

25 GTG GTC AGT GGG GTT CCA AGA TTA TCC CCT AAA ACT CAT AGA CCC AGG 2121 
Val Val Ser Gly Val Pro Arg Leu Ser Pro Lys Thr His Arg Pro Arg 

680 685 690 

TCT CCC AGA CAG AAC AGT ATT GGA AAT ACC CCC AGT GGG CCA GTT CTT ,2169 
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Ser Pro Arg Gin Asn Ser lie Gly Asn Thr Pro Ser Gly Pro Val Leu 

695 700 705 

GCT TCT CCC CAA GCT GGT ATT ATT CCA ACT GAA GCT GTT GCC ATG CCT 2217 

Ala Ser Pro Gin Ala Gly Me lie Pro Thr Glu Ala Val Ala Met Pro 
5 710 715 720 

ATT CCA GCT GCA TCT CCT ACG CCT GCT AGT CCT GCA TCG AAC AGA GCT 2265 

lie Pro Ala Ala Ser Pro Thr Pro Ala Ser Pro Ala Ser Asn Arg Ala 

725 730 735 

GTT ACC CCT TCT AGT GAG GCT AAA GAT TCC AGG CTT CAA GAT CAG AGG 2313 

10 Val Thr Pro Ser Ser Glu Ala Lys Asp Ser Arg Leu Gin Asp Gin Arg 

740 745 750 755 

CAG AAC TCT CCT GCA GGG AAT AAA GAA AAT ATT AAA CCC AAT GAA ACA 2361 

Gin Asn Ser Pro Ala Gly Asn Lys Glu Asn lie Lys Pro Asn Glu Thr 
760 765 770 

15 TCA CCT AGC TTC TCA AAA GCT GAA AAC AAA GGT ATA TCA CCA GTT GTT 2409 

Ser Pro Ser Phe Ser Lys Ala Glu Asn Lys Gly lie Ser Pro Val Val 

775 780 785 

TCT GAA CAT AGA AAA CAG ATT GAT GAT TTA AAG AAA TTT A AG AAT GAT 2457 

Ser Glu His Arg Lys Gin lie Asp Asp Leu Lys Lys Phe Lys Asn Asp 
20 790 795 800 

TTT AGG TTA CAG CCA AGT TCT ACT TCT GAA TCT ATG GAT CAA CTA CTA 2505 

Phe Arg Leu Gin Pro Ser Ser Thr Ser Glu Ser Met Asp Gin Leu Leu 

805 810 815 

AAC AAA AAT AGA GAG GGA GAA AAA TCA AGA GAT TTG ATC AAA GAC AAA 2553 

25 Asn Lys Asn Arg Glu Gly Glu Lys Ser Arg Asp Leu lie Lys Asp Lys 

820 825 830 835 

ATT GAA CCA AGT GCT AAG GAT TCT TTC ATT GAA AAT AGC AGC AGC AAC 2601 

lie Glu Pro Ser Ala Lys Asp Ser Phe lie Glu Asn Ser Ser Ser Asn 
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840 845 850 

TGT ACC AGT GGC AGC AGC AAG CCG AAT AGO GCC AGC ATT TGC OCT TCA 2649 
Cys Thr Ser Gly Ser Ser Lys Pro Asn Ser Pro Ser Me Ser Pro Ser 
855 860 865 

5 ATA CTT AGT AAC ACG GAG CAC AAG AGG GGA CCT GAG GTC ACT TGC CAA 2697 
lie Leu Ser Asn Thr Glu His Lys Arg Gly Pro Glu Val Thr Ser Gin 

870 875 880 

GGG GTT CAG ACT TCC AGC CCA GCA TGT AAA CAA GAG AAA GAC GAT AAG 2745 
Gly Val Gin Thr Ser Ser Pro Ala Cys Lys Gin Glu Lys Asp Asp Lys 
10 885 890 895 

GAA GAG AAG AAA GAC GCA GCT GAG CAA GTT AGG AAA TCA ACA TTG AAT 2793 
Glu Glu Lys Lys Asp Ala Ala Glu Gin Val Arg Lys Ser Thr Leu Asn 
900 905 910 915 

CCC AAT GCA AAG GAG TTC AAC CCA CGT TCC TTC TCT CAG CCA AAG CCT 2841 
15 Pro Asn Ala Lys Glu Phe Asn Pro Arg Ser Phe Ser Gin Pro Lys Pro 

920 925 930 

TCT ACT ACC CCA ACT TCA CCT CGG CCT CAA GCA CAA CCT AGC CCA TCT 2889 
Ser Thr Thr Pro Thr Ser Pro Arg Pro Gin Ala Gin Pro Ser Pro Ser 
935 940 945 

20 ATG GTG GGT CAT CAA CAG CCA ACT CCA GTT TAT ACT CAG CCT GTT TGT 2937 
Met Val Gly His Gin Gin Pro Thr Pro Val Tyr Thr Gin Pro Val Cys 

950 955 960 

TTT GCA CCA AAT ATG ATG TAT CCA GTC CCA GTG AGC CCA GGC GTG CAA 2985 
Phe Ala Pro Asn Met Met Tyr Pro Val Pro Val Ser Pro Gly Val Gin 
25 965 970 975 

CCT TTA TAG CCA ATA CCT ATG ACG CCC ATG CCA GTG AAT CAA GCC AAG 3033 
Pro Leu Tyr Pro lie Pro Met Thr Pro Met Pro Val Asn Gin Ala Lys 
980 985 990 995 



} 

i 
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ACA TAT AGA GCA GTA CCA AAT ATG CCC CAA CAG CGG CAA GAC CAG CAT 3081 
Thr -Tyr Arg Ala Val Pro Asn Met Pro Gin Gin Arg Gin Asp Gin His 

1000 1005 1010 

CAT CAG AGT GCC ATG ATG CAC CCA GCG TCA GCA GCG GGC CCA CCG ATT 3129 
His Gin Ser Ala Met Met His Pro Ala Ser Ala Ala Gly Pro Pro lie 

1015 1020 1025 

GCA GCC ACC CCA CCA GCT TAC TCC ACG CAA TAT GTT GCC TAC AGT CCT 3177 
Ala Ala Thr Pro Pro Ala Tyr Ser Thr Gin Tyr Val Ala Tyr Ser Pro 

1030 1035 1040 

CAG CAG TTC CCA AAT CAG CCC CTT GTT CAG CAT GTG CCA CAT TAT CAG 3225 
Gin Gin Phe Pro Asn Gin Pro Leu Val Gin His Val Pro His Tyr Gin 

1045 1050 1055 

TCT CAG CAT CCT CAT GTC TAT AGT CCT GTA ATA CAG 
Ser Gin His Pro His Val Tyr Ser Pro Val lie Gin 
1060 1065 1070 

ATG ATG GCA CCA CCA ACA CAC GCC CAG CCT GGT TTA 
Met Met Ala Pro Pro Thr His Ala Gin Pro Gly Leu 

1080 1085 
GCA ACT CAG TAC GGG GCT CAT GAG CAG ACG CAT GCG 
Ala Thr Gin Tyr Gly Ala His Glu Gin Thr His Ala 

1095 1100 
CCC AAA TTA CCA TAC AAC AAG GAG ACA AGC CCT TCT 
Pro Lys Leu Pro Tyr Asn Lys Glu Thr Ser Pro Ser 

1110 1115 
ATT TCC ACG GGC TCC CTT GCT CAG CAG TAT GCG CAC 
He Ser Thr Gly Ser Leu Ala Gin Gin Tyr Ala His 

1125 1130 1135 

CTG CAC CCA CAT ACT CCA CAC CCT CAG CCT TCA GCT ACC CCC ACT GGA 3513 



GGT AAT GCT AGA 3273 
Gl y Asn Ala Arg 
1075 

GTA TCT TCT TCA 3321 
Val Ser Ser Ser 
1090 

ATG TAT GCA TGT 3369 
Met Tyr Ala Cys 
1105 

TTC TAC.TTT GCC 3417 
Phe Tyr Phe Ala 
1120 

CCT AAC GCT ACC 3465 
Pro Asn Ala Thr 



3657 



3705 
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Leu His Pro His Thr Pro His Pro Gin Pro Ser Ala Thr Pro Thr Gly 
1140 1145 1150 1155 

GAG GAG CAA AGC CAA CAT GGT GGA AGT CAT CCT GCA CCC AGT CCT GTT 3561 
Gin Gin Gin Ser Gin His Gly Gly Ser His Pro Ala Pro Ser Pro Val 
5 1160 1165 1170 

GAG GAG CAT GAG CAC CAG GCC GCC CAG GOT CTC CAT CTG GCC AGT CCA 3609 
Gin His His Gin His Gin Ala Ala Gin Ala Leu His Leu Ala Ser Pro 

1175 1180 1185 

GAG CAG CAG TCA GCC ATT TAC CAC GCG GGG CTT GCG CCA ACT CCA CCC 
l0 Gin Gin Gin Ser Ala I le Tyr His A.a Gly Leu Ala Pro Thr Pro Pro 
1190 1195 1200 

TCC ATG ACA CCT GCC TCC AAC ACG CAG TCG CCA CAG AAT AGT TTC CCA 
Ser Met Thr Pro Ala Ser Ash Thr Gin Ser Pro Gin Asn Ser Phe Pro 
1205 1210 1215 

15 GCA GCA CAA CAG ACT GTC TTT ACG ATC CAT CCT TCT CAC GTT CAG CCG 
Ala Ala Gin Gin Thr Val Phe Thr Me His Pro Ser His Val Gin Pro 

I ' ioos 1230 1235 

1220 1225 '^ ou 

GCG TAT ACC AAC CCA CCC CAC ATG GCC CAC GTA CCT CAG GGT CAT GTA 
Ala Tyr Thr A sn Pro Pro His Met Ala His Val Pro Gin Ala His Val 
20 1240 1245 1250 

CAG TCA GGA ATG GTT CCT TCT CAT CCA ACT GCC CAT GCG CCA ATG ATG 
Gin Ser Gly Met Val Pro Ser His Pro Thr Ala His Ala Pro Met Met 

1255 1260 _J 1265 

GTA ATG ACG ACA CAG CCA CCC GGC GGT CCC CAG GCC GCC CTC GCT CAA 
2n Leu Met Thr Thr Gin Pro Pro Gly Gly Pro Gin Ala Ala Leu Ala Gin 
1270 1275 1280 

AGT GCA CTA CAG CCC ATT CCA GTC TCG ACA ACA GCG CAT TTC CCC TAT 
Ser Ala Leu Gin Pro Me Pro Val Ser Thr Thr Ala His Phe Pro Tyr 
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380T 



3849 



3897 



3945 
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1285 



1290 



1295 



ATG ACG CAC GCT TCA GTA CAA GCC CAC CAC CAA GAG CAG TTG 



3987 



Met Thr His Pro Ser Val Gin Ala His His Gin Gin Gin Leu 



1300 



1305 



1310 



TAAGGGTGCC CTGGAGGAAC CGAAAGGCCA AATTCCCTCC TCCCTTCTAC TGCTTCTACC 4047 

AACTGGAAGC ACAGAAAACT AGAATTTCAT TTATTTTGTT TTTAAAATAT ATATGTTGAT 4107 

TTCTTGTAAC ATCCAATAGG AATGCTAACA GTTCACTTGC AGTGGAAGAT ACTTGGACCG 4167 

AGTAGAGGCA TTTAGGAACT TGGGGGCTAT TCCATAATTC CATATGCTGT TTCAGAGTCC 4227 

CGCAGGTACC CCAGCTCTGC TTGCCGAAAC TGGAAGTTAT TTATTTTTTA ATAACCCTTG 4287 

AAAGTCATGA ACACATCAGC TAGCAAAAGA AGTAACAAGA GTGATTCTTG CTGCTATTAC 4347 

TGCTAAAAAA AAAAAAAAAA 4367 

mvm^ 2 

mm<D3:Z : 203 
iS£i]£>32 : m& 

CACCACCAGC AACAGCAACA GCAGCAGCAG CAGCAGGAGC AGCAGCAGCA GCAGCAGCAG 60 

CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG 120 

CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG 180 

CAGCATCACG GAAACTCTGG GCC 203 

mvm^ 3 

: 20 

mmom mm 
mo®. : -» 
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CACCACCAGC AACAGCAACA 20 

mm^ : 4 

%ZmO&$ : 20 

mom. : -*ta 

hTtt'n v- : MM* 

GGCCCAGAGT TTCCGTGATG 20 
ffi$l]0>fi£ : 165 

CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGG AGCAGCAGCA GCAGCAGCAG 6( 

CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG 1 2( 

CAGCAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAG 16 ' 

mwmt : 21 

CCCTCACCAT GTCGCTGAAG C 21 
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IBJUS-^ : 7 
ie*U<Dft£ : 19 
®S^(JC0M : ffik 
m<D$k : -»T 
h^'P v- : Ml* 
IBM 

CGACGCTAGA AGGCCGCTG 



mm^- : 8 

IH$lJ0^^ : 19 
ffifD'St : -« 

CTTGCGGACA TTGGCAGGC 

m&m^ : 9 

IB?ljtf>ll£ : 27 
SKDlJt : 

TTCTCTCAGC CAAAGCCTTC TACTACC 



mm ^- : 10 

IE?i]<DS^ : 19 
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TATCCGCAGC TCC6CTCCC 19 

mm^ : 11 

lB5")(Dft$ : 20 
h^P v- : !£' W 

AGCCGGGCCG AAACGCGCCG 20 



1 
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1 TATCCGCACCTCCGCTCCCACCCGGCGCCTCGGCGCGCCCGCCCTCCGATGCGCTCAGCG 

1 F-1006 M R S A 

6 1 GCCGCAGCTCCTCGGAGTGCCGCGGTGGCCACCGAGTCTCGCCGCTTCGCCGCAGCCAGG 

5 A A A P R S P A VAT E S R R F A A A R 

121 TGGCCCGGGTGGCGCTCGCTCCAGCGGCCGGCGCGGCGGAGCGGGCGGGGCGGCGGTGGC 

25 W P G W R S h Q R P A R R S G R G G G G 

181 GCGGCCCCGGGACCGTATCCCTCCGCCGCCCCTCCCCCGCCCGGCCCCGGCCCCCCTCCC 

AS A A P G P Y P S A A P P P P G P G P P P 

241 TCCCGGCAGAGCTCGCCTCCCTCCGCCTCAG ACTGTTTTGGTAGC AACGGC AACGGCGGC 

65 S R Q S S P P S A S D C F G S N G N G G 

3 01 GGCGCGTTTCGGCCCGG'CTCCC'GGCGGCTCCTTGGTCTCGGCGGGCCTCCCCGCCCCTTC 

— R-1002 

85 G A F R P G S R R L L G L G GPP R P F 

361 GTCGTCGTCCTTCTCCCCCTCGCGAGCCCGGGCGCCCCTCCGGCCGCGCC AACCCGCGCc: 

105 V V V L L P L A S P G A P P A A P T R A . 

421 TCCCCGCTCGGCGCCCGTGCGTCCCCGCCGCGTTCCGGCGTCTCCTTGGCGCGCCCGGCT 

125 S P L G A R ASPP RSGVSL A R P A 

481 CCCGGCTGTCCCCGGCCGGCGTGCGAGCCGGTGTATGGGCCCCTCACCATGTCGCTGAAG 

F-1 

145 P G C P R P A C E P V Y G P L T M S L K 

5 4 1 J3CCCAGC AGCAGCAGGAGCAGCAGCAGCAGCAGCAGCAGCAGCAACAGCAGCAGCAGCAG 

165 P Q Q Q Q Q Q Q Q Q Q 0 Q Q Q Q Q Q Q Q 

601 CAGGAGCAGCAGCCGCCGGCCGCGGGTGGCAATGTGCGCAAGCCCGGCGGCAGCGGCCTT 

— : — R-2 — — 

185 Q Q Q Q P P P A A A N V R K P G G S G L 

66 1 CTAGCGTCGCCCGGCGCCGCGCCTTCGGCGTCGTCGTGCTCGGTCTCCTCGTCCTCGGCC 

205 L A S P A A APS P S S S S V S S S S A 

7 21 ACGGCTCGCTCCTCGGTGGTGGCGGCGAGCTCCGGCGGCGGGAGGCCCGGCCTGGGCAGA 

225 JT A P S S V V A A T 5 G G G R P G L G R 

7 81 GGTCGAAACAGTAACAAAGGACTGCCTCAGTCTACGATTTCTTTTGATGGAATCTATGCA 

245 G R N S N K G L P Q S T I S F D G I Y A 

841; AATATGAGGATGGTTCATATACTTACATCAGTTGTTGGCTCCAAATGTGAAGTACAAGTG 

265 N M R M V H I L T S V V G S K C E V Q V 

901 AAAAATGGAGGTATATATGAAGGAGTTTTTAAAACTTACAGTCCGAAGTGTGATTTGGTA 

285 K N G G I Y E G V F K T Y S P K C D h V 

96 1 CTTGATGCCGCACATGAGAAAAGTAGAGAATCCAGTTCGGGGCCGAAACGTGAAGAAATA 

305 L D A A H E K S T E S S S G P K R E E I 

1021 ATGGAGAGTATTTTGTTC AAATGTTC AGACTTTGTTGTGGTACAGTTTAAAGATATGGAC 

325 M E S I L F K C S D F V V V Q F K D M D 

10 8 1 TC C AGTT ATG C AAAAAG AG ATGCTTTTAC TG AC TC TG CT ATC AGTGCT AAAGT G AATGG C 

345 S S Y A K R D A F T D S A I S A K V N G 

1141 GAAC AC AAAGAGAAGGACCTGGAGCCCTGGGATGCAGGTGAACTC ACAGCC AATGAGGAA 

365 E H K E K D L E P W D A G E L T A N E E 

1201 CTTGAGGCTTTGGAAAATGACGTATGTAATGGATGGGATCCCAATGATATGTTTeGATAT 

385 It E A L E N D V S N G W D P N D M F R V 

1261 AATGAAGAAAATTATGGTGTAGTGTCTACGTATGATAGC AGTTTATCTTCGTATAC AGTG 

405 M E E N Y G V V S T Y D ' S S L S S Y T V 



WO 98/18920 



PCT/JP97/03946 



2/6 

1321 CCCTTAGAAAGAGATAACTCAGAAGAATTTTTAAAACGGGAAGCAAGGGCAAACCAGTTA 

425 P L E R D N S E E F 1. K R E A R A N Q L 
13 81 GC AG AAGAAATTGAGTC AAGTGC C C AGTAC AAAG C TCGAGTGGCCC TGG AAAA C G A TGA T 

445 A E E I E S SAQ YKA R V A L EN DD 
1441 AGG AGTG AG G AAG AAAAAT ACAC AGC AGTTC AG AG AAATTC C AGTG AAC GTG AGGG G C AC 

465 R S E E E K Y T A V Q R N S S E R E G H 
150 1 AGGATAAACACTAGGGAAAATAAATATATTCCTCCTGGACAAAGAAATAGAGAAGTCATA 

48S S I N T R E N K Y I P P G Q R N R E V I 
1561 TCCTGGGGAAGTGGGAGACAGAATTCACCGCGTATGGGCCAGCCTGGATCGGGCTCCATG '. 

505 S W G S G R Q N S P R M G Q P G S G S M 
1621 C C ATC AAG ATG G ACTTC TC AC AC TTC AG ATTTC AAC CCG AATTCTGGTTC AG AC C AAAGA 

525 P S R S T S H T S O F W P ' N S G S D Q R 

16 81 GTAGTTAATGGAGGTGTTCCCTGGCCATCGCCTTGCCCATCTCCTTCCTCTCGCCCACCT 
545 V V N G G V P W P S P C P £ P S S R P P 

17 41 TCTCGCTACCAGTCAGGTCCCAACTCTCTTCCACCTCGGGCAGCCACCCCTACACGGCCG 
565 S R Y Q S G P N S L P P R A A T PTE P 

1801 CCCTCCAGGCCCCCCTCGCGGCCATCCAGACCCCCGTCTCACCCCTCTGCTCATGGTTCT 
585 P S R P P S R P S R P P S H P S A H G S 

1 8 61 CCAGCTCCTGTCTCTACTATGCCTAAACGCATGTCTTCAGAAGGGCCTCCAAGGATGTCC 
605 P A P V S T M P K R M S S E G P P R M S 

1921 CCAAAGGCCCAGCGACATCCTCGAAATCACAGAGTTTCTGCTGGGAGGGGTTCCATATCC 

625 P K A Q R H P R N H R V S A G R G S I S 
1981 AGTGGCCTAGAATTTGTATCCCACAACCCACCCAGTGAAGCAGCTACTCCTCCAGTAGCA 

645 S G L E F V S H N P P SEA A T P P V A 
2 041 AGGACCAGTCCCTCGGGGGGAACGTGGTCATCAGTGGTCAGTGGGGTTCCAAGATTATCC 

665 R T S P S G G T W S S V V S G V P R L S 
2101 CCTAAAACTCATAGACCCAGGTCTCCCAGACAGAACAGTATTGGAAATACCCCCAGTGGG 

685 P K T K R P R S P R Q N SIGNTPSG 
2161 CCAGTTCTTGCTTCTCCCCAAGCTGGTATTATTCCAACTGAAGCTGTTGCCATGCCTATT 

705 P V L A S P Q A G I I P T E A V A M P I 
2 22 1 CCAGCTGGATCTCCTACGCGTGCTAGTCCTGCATCGAACAGAGCTGTTACCCCTTCTAGT 

725 P A A S P T P A S P A S N R A V T P S S 
2281 GAGGCTAAAGATTCCAGGCTTCAAGATCAGAGGCAGAACTCTCCTGCAGGGAATAAAGAA 

745 E A K D S R L Q D Q R Q N S P A G N K E 
2341 AATATTAAACCCAATGAAAC ATCACCTAGCTTCTCAAAAGCTGAAAACAAAGGTATATCA 

765 N I K P NET S P S F S K A E N K G I S 
2401 CCAGTTGTTTCTGAACATAGAAAACAGATTGATGATTTAAAGAAATTTAAGAATGATTTT 

785 P V V S E H R K Q I D D h K K F K N D F 
2461 AGGTTAC AGCCAAGTTCTACTTCTGAATCTATGGATC AACTACTAAACAAAAATAGAGAG 

805 R L Q P S S T S E S M D Q L L N K N R E 
2521 GGAGAAAAATCAAGAGATTTGATCAAAGACAAAATTGAACCAAGTGCTAAGGATTCTTTC 

825 G E K S R D L I K D K I E P S A K D S F 
25 8 1 ATTGAAAATAGCAGCAGCAAGTGTACCAGTGGCAGCAGCAAGCCGAATAGCCCCAGCATT„ 

84 5 I E N S S S N C T "s G S S K P N S P S I 
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2 64 1 TCCCCTTCAATACTTAGTAACACGGAGCACAAGAGGGGACCTGAGGTCACTTCCCAAGGG 
865 S P S I h S N T E H K R G P E V T S Q G 

27 01 GTTC AGAC TTC C AGC CC AGCATGTAAAC AAGAG AAAG AC G ATAAGG AAG AG AAG AAAGAG 
885 V Q T $ S P A C K Q E K D D K E E K K D 

27 61 GCAGCTGAGCAAGTTAGGAAATCAACATTGAATCCC AATGCAAAGGAGTTCAACCC ACGT 

905 A A E Q V R K S T L N P.N A K E F N P R 

2821 TCCTTCTCTCAGCC AAAGCCTTCTACTACCCC AACTTC ACCTCGGCCTC AAGCACAACCT 
F-13 : " ~ 

925 S F S Q P K P S T T P T S P R P Q A Q P 
2 881 AGCCCATGTATGGTGGGTCATC AACAGCCAACTCCAGTTTATACTCAGCCTGTTTGTTTT 
945 S P S M V G H Q Q P T P V Y T Q P V C F 

2 941 GCACGAAATATGATGTATCCAGTCCCAGTGAGCCCAGGCGTGCAACCTTTATAGCCAATA 
9 65 A P N M M Y P V P V S P G V Q P L Y P I 

3 0 01 CCTATGACGCCCATGCCAGTGAATCAAGCCAAGACATATAGAGCAGTACCAAATATGCCC 
985 P M T P M P V N Q A K T Y R A VPN M P 

3061 C AACAGCGGCAAGACCAGCATC ATC AG AGTGC C ATGATGC AC CC AGC GTC AGC AGC GGGC 
1005 Q Q R Q D Q H H Q S A M M H P A S A A G 
3121 CCACCGATTGCAGCCACCCCACCAGCTTACTCC ACGCAATATGTTGCCTAC AGTCCTCAG 
1025 P P I A A T P P A Y S T Q Y V A Y S P Q 
3 18 1 C AGTT CC C AAATC AGCC C CTTGTTC AGC AT GTG C C AC ATT ATC AGTCTC AGC ATC C TC AT 
1045 Q F P N Q P L V Q H V P H Y Q S Q H P H 

32 41 GTCTATAGTCCTGTAATACAGGGTAATGCTAGAATGATGGC ACC ACC AAC ACACGCCCAG 
1065 V Y S P V I Q G N A R M M A P P T H A Q 

3 3 0 1 CCTGGTTTAGTATCTTCTTCAGCAACTCAGTACGGGGCTCATGAGCAGACGCATGCGATG 
1085 P G L V S S SAT Q Y G A H E Q T H A M 

33 61 TATGCATGTCCCAAATTACCATACAACAAGGAGACAAGCCCTTCTTTCTACTTTGCCATT 
1105 Y A C PKL PYNKETS PS F Y F A I 
3421 TCCACGGGCTCCCTTGCTCAGCAGTATGCGCACCCTAACGCTACCCTGCACCCACATACT 
1125 S T G S L A Q Q Y A H P N A T L H P H T 

3 4 81 CCACACCCTCAGCCTTCAGCTACCCCCACTGGACAGCAGCAAAGCCAACATGGTGGAAGT 
1145 P H P Q P S A T P T G Q Q Q S Q H G G S 
3541 CATCCTGCACCCAGTCCTGTTCAGCACCATCAGCACCAGGCCGCCCAGGCTCTCCATCTG 
1165 H P A P S P V Q H H Q H Q A A Q A L M L 
3601 GCCAGTCCACAGCAGCAGTCAGCCATTTACCACGCGGGGCTTGCGCCAACTCCACCCTCC 
1185 A S P Q Q Q S A I Y H A G L A P T P P S 
3 661 ATGACACCTGCCTCCAACACGCAGTCGCCACAGAATAGTTTCCCAGCAGCACAACAGACT 
1205 M T P A S N T Q S P Q N S F P A A Q Q T 
3 721 GTCTTTACGATCCATC CTTCTCACGTTCAGC CGGC GTATAC CAACCC AC CC CACATGGCC 
1225 V F T I H P SHVQPAYT N P P H M A 
3781 CACGTACCTCAGGCTCATGTACAGTCAGGAATGGTTCCTTCTC ATCCAACTGCCCATGCG 
1245 H V P Q A H V Q S G M V P S H P T A H A 
3841 CCAATGATGCTAATGACGACACAGCCACCCGGCGGTCCCCAGGCCGCCCTCGCTCAAAGT 
1265 P M M L M T T Q P P G G P Q A A L A Q S 
3 901 GCACTACAGCCCATTCCAGTCTCGACAACAGCGCATTTCCCCTATATGACGCACCCTTCA 
1285 A L Q P I P V S T T A H F P Y M T H P S 
3 9 61 GTACAAGCCCACCACCAACAGCAGTTGTAAGGCTGCCCTGGAGGAACCGAAAGGCCAAAT 

1305 V Q A H H Q Q Q L * 
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4 0 21 TCCCTCCTCCCTTCTACTGGTTCTACCAACTGGAAGC ACAGAAAACTAGAATTTCATTTA 
4 0 81 TTTTGTTTTTAAAATATATATGTTGATTTCTTGTAACATCCAATAGGAATGCTAACAGTT 
4141 'CACTTGC AGTGGAAGATACTTGGACCGAGTAGAGGCATTTAGGAACTTGGGGGCTATTCC 
42 01 ATAATTCCATATGCTGTTTCAGAGTCCCGCAGGTACCCCAGCTCTGCTTGCCGAAACTGG 
42 61 AAGTTATTTATTTTTTAATAACCCTTGAAAGTGATGAACACATCAGCTAGC AAAAG AAGT 
4321 AACAAGAGTGATTCTTGCTGCTATTACTGCT (A) „ 
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